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A method for in vivo prod -c tier, cf a library ir. ceils c:mp' 
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machinery. Tne genetic clement comprises i) ar. cricin cf replica: 
e Z a gene conferring resistance towards an antibiotic, iii) a <;:;;e t 
a DNA :ecuen:: encoding a desired variant of a rev. "er:;Je c' m: 



a multitude of mutated genetic elements, wherein an crror-prcr.e 
cf a r.enctic element independently of the host chromosomal replication 
ticn frcrn which replication is initiated, ii) optionally a genetic mx-Jcer. 
mccdmo, the polypeptide of interest. Also methods for the generation cf 
ere::. and for the determination cf such a DNA sequence arc describe, j 
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5 PTTTT.D OF V TF. TNVF.NTION 

The present invention relates to methods for in vivo production 
of libraries of polypeptide variants, the screening of these 
variants and selection of those exhibiting desired properties, 
xo The invention furthermore relates to methods for producing the 
desired polypeptide variants. 

partfrtRnilND ™ ™~ TMVFINTION 

15 An increasing number of polypeptides, including enzymes and non- 
enzymatic proteins, are being produced industrially, for 
various industries, household, food/feed, cosmetics, medicine 
etc. One of the major sources for these proteins is and have been 
microorganism found in nature. 

20 u -ino-ina nolvoeptides with new and 

Th» classical approach for anaing poiypep 

. , have b—n to screen wild type organisms 

special properties, have o__n l. 

h as b»en a very successful way o. 
present in nature. inis has D.en r 

, n^ DC -n b» used in such diverse areas as the 
procuring polypeptides uo o_ ui>eu 

25 above mentioned applications. 

How-v-r, often it has not been possible to produce such 
peptides in sufficient amounts because the quantities 
produced in the natural host systems were too minute to allow a 
30 "reduction, and even if the cost was no problem, d«£xc«ltx.. 
'could be encountered in providing sufficient amounts in relation 
to the demand {e.g. human growth hormone) . 

^ a i a ra= r= a -ee been overcome by the advent 
Such problems have to a larg- — g-e- 
35 of recombinant techniques for the production of polypeptides, in 
this art polypeptides are produced by the use of boreal 
system. Genes encoding certain polypeptides are cloned ana 
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transferred into cells that will produce the polypeptides in 
quantities much larger than those, wherein they are produced in 
the original organism. Over the latest twenty years a larg- 
number of methods for the production of polypeptides according to 
5 such techniques have been developed. 

Often, proteins from natural sources do not meet the requirements 
for certain applications, and it will be necessary to modify 
existing proteins towards certain activities or biophysical 

10 properties. 

It is possible to generate new variants of a protein by classical 
mutagenesis of the microorganism using radiation (X-ray and UV) 
or chemical mutagens. However, since this approach is a very 
is labour and -time consuming process, in the same last two decades 
researchers have been developing improvements on existinc 
polypeptides by using more specific and selective recombinant 
techniques, such as protein and genetic engineering for creatine 
artificial diversity. 



20 



Based upon considerations using knowledge of the structure- 
function relationships and general protein chemistry, researchers 
have come a long way m designing polypeptide variants exhibiting 
improvements in various properties. 

23 

However, it has also beer, realised that the various interactions 
into which polypeptides take part, are so complex that rational 
design according to such knowledge has serious limitations, and 
in recent years methods employing random mutagenesis followed by 
30 screening of or selection from very large numbers of variants 
produced therefrom has gained interest. 

For this purpose a microbial library of mutants is generated for 
subsequent expression ana screening to determine variants 
35 possessing the desired properties. 
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Over the - years many both in vitro and in vivo DNA mutagenesis 
techniques for creating high numbers of different variants of 
polypeptides have been developed. 

5 Considering the fact that a typical naturally occurring 
polypeptide consists of between 100 and 1000 amino acids, and 
each may be varied in 20 ways (only to stay within the naturally 
occurring amino acids), the number of possible variants for a 
soecific polypeptide is enormous. Since the main parameter that 
10 defines or measures the usefulness of a microbial collection or 
library used to identify improved variants of polypeptide is the 
number of different variants, N, which is comprised in the 
collection, a need for large libraries has emerged. 

is Especially in cases when a powerful selection system is 
available, the limiting factor for the identification of the 
desired polypeptide is the size of the library. 

In in vitro systems the practical, state of art, limit for N is 
20 about 10 8 . This is mainly due to inefficiency of transformation 
(introduction of DNA into the cell) of the manipulated DNA into 
the host organism. This number varies a lot from organism to 
organism: in the presently best case, E. coli, the usual 
efficiency of transformation cf in vitro manipulated DNA, e.g. a 
2 5 iiaation of DNA fragments or chemical treatment of DNA, leads at 
the most to library sizes up to 10 e bacteria (Greg Winter, 
Current methods in Im-nunology 5: 253-255, 1993). Very few 
examples of libraries of this size have been reported. 

30 In vitro library constructions in other prokaryotes, such as 
Bacillus sp., Streptococcus so. or Staphylococcus sp. will for 
practical reasons be orders of magnitude below this number. 

Considering eukaryotic hosts such as Saccharomyces cerevisiae or 
35 various Aspergillus sp,, an even lower number of transf ormants 
can be expected from in vitro manipulated DNA. 
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A special case of a large library has been reported based on in 
vivo recombinati on between libraries of antibody light and heavy 
chains based on a specially designed system useful for that 
5 particular case (Griffiths, A.D. et al . , 1994, EMBO J. 14: 3245- 
3260) . 

A number of methods are available to generate variants of a 
polypeptide in microorganisms in vivo, ranging from very simple, 

10 such as treating cells with chemical or physical mutagens, to 
rather complex, relying on cells that contain an error-prone DNA 
polymerase but lack the mismatch repair system which corrects the 
errors {Stratagene, XLl-red (mutS, mutD, mutT) Catalog #200129) . 
But these techniques have a major drawback as the mutagenesis is 

is not targeted to a specific part of the genome (coding for the 
polypeptide of interest) and high frequencies of mutations are 
generated also in essential genes for the cell as well as ' in the 
target gene, resulting in massive cell death, together with a 
high number of cells, where the mutations do not influence the 

20 polypeptide of interest. Such "noise" will limit the accumulation 
of mutations in the target region. 

It is therefore the object of the invention to provide an in vivo 
target region- specif ic mutagenesis procedure in order to produce 
25 very large numbers, N , of polypeptide variants. 

A second object of the invention relates to the screening or 
selection of variants with the desired properties, both by 
existing and future technologies. 

30 

SUMMARY OF THE INVENTION 

The present invention therefore relates to a method for in vivo 
35 production of a library in cells comprising a multitude of 
mutated genetic elements, wherein an error-prone polymerase is 
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used in ' each ancestral cell to replicate all or a part of a 
genetic element comprising 

i) an origin of replication from which replication is 
initiated, 

ii) optionally a genetic marker, e.g. a gene conferring 
resistance towards an antibiotic, 

iii) a gene encoding the polypeptide of interest, 
independently of the host chromosomal replication machinery. 

The invention furthermore relates to a method for the generation 
of a DNA sequence encoding a desired variant of a polypeptide of 
interest, wherein 

i) a mutant library is produced by the above method, 

ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce 
polypeptide variants, 

said variant polypeptides are screened or selected for a 
desired property, and hosts producing such desired 
variants identified and isolated, 

said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 
desired variant. 

and a method for the determination of the DNA sequence encoding a 
desired variant of a polypeptide of interest, wherein 

(i) a mutant library is produced by the above method, 

(ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 
polypeptides, 

(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

(iv) said genetic element ir. said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 
desired variant . 



in) 



iv) 
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The screening of the library or the selection of the variants 
depends on the specific polypeptide and which properties thereof 
it is desired to improve and/or retain. It is therefore necessary 
to set up a screening protocol for each case. Such protocols 
s involving a number of assays are described in the literature 
(Clackson et al . , Nature 352:624-528, 1991, Bryan, P et.al., 
Proteins 1:326-334, 1986). 

An elegant approach to the combination of the generation of 
o diversity and the selection of variants with the desired 
properties would be a combination of the in vivo method of the 
invention for generating the diversity with a phage display 
system (Greg Winter, Supra) . 

5 A specific example of a polypeptide of interest is the alkaline 
proteases used in the detergent industry for the removal of 
proteinaceous stains from fabric. In that case the screening may 
be performed in actual detergent compositions to investigate 
properties such as thermal stability, oxidation stability. 

) storage stability, substrate specificity and- affinity, stability 
to non-aqueous solvents, pH profile, ionic strength dependence, 
catalytic efficiency, and wash performance. 

Furthermore the invention relates to a process for the production 
of a desired polypeptide variant, wherein 

(i) a DNA sequence encoding a polypeptide of interest that has 
been determined according to the method above is intro- 
duced into a suitable host in a manner whereby it can be 
expressed in said host, 

(ii) said host is cultivated under conditions conducive to the 
expression of said DNA sequence, and 

(iia) said polypeptide variant is recovered. 

Methods for the introduction, of the DNA sequence selected into 
suitable host systems are described in, (Sambrook et al. 
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Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Lab., 
Cold Spring Harbor, NY.) . 

It is also within the abilities of the skilled person to select 
5 suitable growth media and other conditions for the host system 
selected that are conducive for the expression of the polypeptide 
variant od interest. Guidance hereto may f. ex. be found m 
(Sambrook et al . , supra). 

io Also for the recovery of the polypeptide a large number of 
methods are available for the separation and purification or 
proteins, e.g. in (Scopes, R.K., protein Purification (1987), 
Springer-Verlag) 

is Lastly, the invention relates to the polypeptides produced by the 

above method. 



HPTATLED DE crpTPTTn ^ OF THE I^E!^lIQN 

Th- invention comprises a method to construct in vivo libraries 
of variants in a gene of interest. The method involves the use or 
a cenetic element, such as a bacteriophage or a plasmid that is 

t • in^^on'-lv of the host chromosomal, 

able to replicate ir.-e~-..-~ n.iy 

2 s replication system By the use of the possibility to separate the 
.epilation of the host chromosome and the replication of the 
«n-t<c element (ohage/plasmid) , it is possible by modifications 
of o^ replication system to selectively introduce mutations m 
the genetic element (phage/plasmid) keeping the chromosome of the 

30 host intact. This means that the generation of variation m the 
gene of interest does not compromise the viability of the host. 

, . „. nn a nic'-'v accurate process, and the 

DNA replication is a 

«tP of th° r---- omosomal replication in E. coll 
misincorporation rate or cn_ 

^ _ u n-de^- of 10" 10 pr. base pr. round 

35 has been estimated to be m ^ o.a_. P 

ThP base naming carried out during DNA 
of replication. The base pa—'a 

-. , = _ nrefe^ence for the polymerase to 

reDlication leads to a prei.e-.eu 
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incorporate the correct base at a certain position, which 
accounts for approximately 10 5 of the overall replication 
fidelity. If an incorrect base has been incorporated, the 
polymerase will stall and a 3 '-5' exonuclease will often remove 
s the 3' reincorporated base. This part denoted proof-reading 
accounts for approximately 10 J of the overall replication 
fidelity. The repair system of the cell accounts for the last 10 3 
of the fidelity rate. 

io Replication of the chromosome in E. coli is for the most part 
carried out by DNA polymerase III holoenzyme, which is a multi- 
protein complex containing 10 different polypeptides including a 
polymerase (alpha sub-unit, pole gene) and a 3--5 1 exonuclease 
{dnaQ gene) . 

15 

A further polymerase, DNA polymerase I (DNA pol I, po lA gene), 
contains three different activities, viz. a DNA polymerase 
activity, a 3 '-5' exonuclease activity, and a 5 '-3' exonuclease 
activity despite the fact that it is one single polypeptide. This 
20 polymerase has several functions in the cell. Besides DNA repair, 
DNA pol I is also needed for the chromosomal DNA replication, as 
it is involved in the assembly of DNA fragments during synthesis 
of the lagging DNA strand. However,, it replicates only a very 
minor portion of the genome. 

This polymerase is furthermore involved in initiation of DNA 
replication of certain classes of plasmids, e.g. ColEI origin of 
replication-based plasmids such as pBR322 in Escherichia coli and 
Gram-negative bacteria or pAMSl like plasmids in Gram-positive 

30 bacteria. Such plasmids may be able to replicate completely 
through the activity of DNA polymerase I without DNA polymerase 
III being present in active form, e.g. if this enzyme is 
dysfunctional due to genetic causes, e.g. temperature sensitive 
variants at a non-permissive temperature), or under conditions 

35 where only limited amounts of DNA Polymerase III are present in 
the cell. 
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It is well known that certain mutations lead to a decrease (or an 
increase) in the fidelity of DNA replication. These mutations 
have been mapped to reside mainly in polymerases, exonucleases or 
5 in elements of the repair system. Unfortunately, most of these 
mutations alter/impair the fidelity rate for the complete genome 
present, and such non-targeted mutations are not desirable. 

One example of such a mutation could be an inactivation of the 
10 3' -5' exonuclease activity of DNA pol I. 

However, according to the invention use is being made of the fact 
that some elements in the replication system may be temporarily 
-switched" off, fully or partially, thereby stopping or greatly 
15 slowing down the replication of the genome, while replication of 
certain genetic elements as defined herein is continued. 

An E. coli strain containing a temperature sensitive DNA pol III 
(i.e. the polymerase a-sub-unit or another temperature sensitive 

20 sub-unit ■ that render the holoenzyme conditionally non- 
functional) , or a function required for initiation of chromosomal 
reolication, such as DnaA, an error prone DNA pol I and a colEI 
based plasmid containing a gene of interest, is an example or a 
g«.„o t ^ system according to the invention designed to speci.fi- 

25 ca llv introduce mutations ir. the plasmid (and the gene of 
interest) . 

in such a system raising the temperature to a non-permissive 
valu- will have the effect that DNA pol III cer.ses to function 
30 fully, while the error prone DNA pol I will retain its function 
and replicates the plasmid with reduced fidelity resulting m 
mutated copies of the plasmid. 

Since the generation of mutations is random, each cell will 
35 generate unioue mutations and upon lowering the temperature, the 
temperature sensitive function will become active again, ana 
normal replication of the cells continue. 
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A variation- would be an E. coli strain with temperature sensitive 
alleles of pollll and poll and an inducible expression (by a ts 
repressor (temperature-sensitive) or by chemical induction) G f 
s the error-prone polymerase. At a restrictive temperature and the 
presence of the inducer mutations will accumulate in the genetic 
element. At permissive temperature and the absence of the 
inducer, the complete systems functions as the wild type cell. 

io Accordingly the invention in its first aspect relates to a method 
for in vivo production of a mutant library in cells comprising a 
multitude of mutated genetic elements, wherein an error-pron- 
polymerase is used in each ancestral cell to replicate all or a 
part of a genetic element comprising 

is i) an origin of replication from which replication is 

initiated, 

ii) optionally a genetic marker, e.g. a gene conferring 
resistance towards an antibiotic, 

iii) a gene encoding the polypeptide of interest, 

20 independently of the host chromosomal replication machinery. 

The invention consequently comprises a method for in vivo 
production of a library ir. cells comprising a multitude of 
mutated genetic elements co-prising 
25 A ) providing a cell having 

i) an error-prone polymerase that independently of the 
chromosomal replication machinery of said cell will 
replicate all or a part of a genetic element 
comprising 

30 a) an o-igin of replication from which replication 

is initiated, 

b) optionally a genetic marker, e.g. a gen- 
conferring resistance towards an antibiotic, 

c) a gene encoding the polypeptide of interest, 
35 and 
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B) 
C) 



E) 



11 

ii) a chromosomal replication machinery that can be 
reversibly induced to be substantially non- 

functional , 

growing such a cell under conditions conducive to its 
replication to obtain a multitude of ancestral cells, 
reversibly inducing said chromosomal replication machinery 
in said ancestral cells to be substantially non-functional 
for a period of time sufficient to allow for the 
replication of said genetic element by said error-prone 
polymerase to generate mutations in said genetic element, 
reversibly inducing said chromosomal replication machinery 
in such mutated cells to be substantially functional, and 
growing such mutated cells "under conditions conducive to 
their replication. 

in this context the expression "mutant library" means a set -of 
cells, bacteria or phages (typically 10' to 10» cells or phages) 
that differs with respect to one particular gene encoding a 
polypeptide of interest. Typically one would like to introduce 
o one or more different amino acid alterations in this particular 
polypeptide in each member of the library. 

in this context the expression "error-prone polymerase" means a 
po^rase that during DNA replication will incorporate mistakes 
s to- of the wrong nucleotides in a given position or cause a 
action or an insertion of one or- several nucleotides) with 
h'ioher frequency than the polymerase normally used for this 
purpose (e.g. E . coli DNA pol 1, Bacillus subtilis DNA pol I, T4 
DNA polymerase, T7 DNA polymerase) . 

30 Th~ egression "ancestral ceil" here means such cells wherein no 
mutations have been introduced. In some embodiments of the 
invention the mutation cycle may be reiterated, and in that case 
such cells that were initially cutated become ancestral cells for 

35 the second mutation cycle, etc. 
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In this context the expression "host chromosomal replication 
machinery- means the DNA polymerase or DNA polymerase holoenzyme 
that is mainly responsible for the replication of the host 
chromosome, e.g. DNA polymerase III in E. coli. 

5 

in this context the expression "genetic element" means a small 
(from 1 or 2 kilo bases to 100 kilo bases) entity consisting of 
RNA or DNA, that is able to replicate independently, i. e it 
contains an origin of replication. The genetic element would 
io typically be a bacteriophage, a phagemid, or a plasmid. The 
genetic element must also according to the invention comprise a 
gene encoding the polypeptide of- interest, and it may further 
comprise a genetic marker, e.g. a gene conferring resistance 
towards an antibiotic. 



15 



A virus, a retrovirus, or a transposon that is able to replicate 
independently of the host replication machinery, eg 
retrotransposons could also be used as the "genetic element". 

o Kim and Loeb (1995, PNAS 32 634-58B) have demonstrated that HIV 
reverse transcriptase (KIV-RT) is able to complement E. coli DNA 
Pol I with respect to chromosomal DNA replication and initiation 
of plasmid DNA replication . The reincorporation rate of HIV-RT 
(ar.a related retroviral reverse transcriptases) is several orders 

s or magnitude higher than tr.e rate of DNA pol I, i. e . i 0 -' to io- 
reincorporations P r . base pr. round of replication. The use 
sucr. a polymerase in stead of a mutated error prone E. coli DMA 
pol I in an embodiment of the invention would significantly 
increase the frequency of replication errors in the system 
o described above. 

In a further embodiment of : :, invention the mutation cycle 
described above can be reitera-.ed. i.e. the mutagenic polymerase 
switched on and off several ti.es, thereby generating even more 
> mutants. Such a step could furthermore help the segregation of 
plasmids if a multicopy plasrcid is used as the genetic element 
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in certain genetic elements one can envision that only the part 
of the genetic element located in the vicinity of the origin of 
replication is replicated by the error-prone polymerase. In such 
s cases, the gene of interest should be situated withxn this 
region . 

in a specific embodiment of the invention the genetic element is 
a phage, wherein the gene encoding the polypeptide of interest is 

io positioned at a locus where the polypeptide upon expression is 
displayed from the surface of the phage, whereby a screening can 
be performed directly (see Greg Winther, supra) . To ensure the 
correspondence between DMA sequence of the phage and the protein 
displayed the primary phage stock should be passed through wild 

15 type E. coli, at low multiplicity of infection, prior to 
selection or screening. 

t-vw. freau-r-v of mutations, the method of the 
To further increase the trequ-r.-y u . , • 

. j irrDnr , where the method is used m 
invention comprises embodiments v.nere 

t ,i*\, * reoa^ deficient host, e.g. mutL, mutS, mutH, 
20 conjunction with a repa_~ u_i--v- 

or a combination of mutator genome types. 

h . e „„-„^n "-eDair deficient host" means a 
In this context the express-— 



ore alterations in genes encoding 



cell containing one cr 

• t-o b« d<-«ctlv or indirectly involved in the DNA 

>~c " e ■ ns known to ^ ^ - 

" t*. ^ c=— h nutations is that a higher frequency 

eoa -r The result or sj^n macauxuii^ 

^ rnw r-o nolvm^rases, chemicals, X-ray, UV 
f introduced mutations (oy t^e poiyia^a 



25 p 

reoair 
o 

light, etc.) will not 



be repaired and will be "permanently" 



incorporated in the genome, the so called mutator phenotype. 
30 Examples of such genes are mutL, mutS, mutU, inutT. 

-i~ m =>n- n r« could as indicated above use a 
As the genetic element one c~— u ^ 

< f = rO-^-.c ^ order to couple the variant 
phagemid in steaa of a plos.^c 

generation to a display system, e.g. M13, fl, fd. 



35 

In 



this context the expression "phagemid" means a plasmid that 
besides its plasmid origin of replication contains a phage origin 
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1) 
2) 
3) 



of replication, phagemids are dependent of the conditions able r 

plasmid - - * — — : 

5 :^:::i:ir em wouid invoive — - ■ 

a plasmid origin of replication, e.g. ColEI 
a M13 phage origin of replication 

a chimeric gene consisting of the. gene of interest fuse - 
to the gene encoding cm protein . (0r ^ p ^ ™ 
Acid. Res. 21: 4491-4498, 19 93 ). 

The fi rsc step would be the generation of dive^ k 
grow lng / maintaini ot diversity by 

» ^ as desc ; ibed abo „ Th r r :: on r a ;: formed ; ith chis 

^.ction with the he lp er p ha„ ln ^l/^ * 
branded ph ag .„id that will b- Ja-^d ,° n „ Si " 9le 
Pha 9 es discing the variant "» 

« t • ^ LGins ca n then be subiprt-o^ 

selection procedure. sheeted to a 
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3»0°; a"";!"' 8 "" 10 " 33 " SUCh " " ^ " ^ or 
3-0., and Ph l23 ln Bacillus cortain their ^ po ,^ rases 

» containing an error-prone D..A polymerase . ' ^ 

^Lty s :r ec r; f rr ion th ' e e — p — - 

30 variant of E coii nva i P ^ fe ^ ea e «or-prone polymerase is a 
' e I or HIV reverse transcriptase. 

As a polypeptide of interest p 

couxd.be mentioned. Sn ong th-s- ^1 blol °^«l activities 

3S blood-clotting factors ^ Z h T"' ° DeS ' "^"^ 

3 actors, anti- microblal agents, and other such 
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, for t he prophylaxis and treatment of 
Among such industrial ^ oxid oreductases, trans- 

— 1 rri,sr^ 

f erases, phytases, 

isomerases, lyases, and ligases. 

h , he expression "carbonyl hydrolase" means enzymes 
in this context the expressi C( _ 0 )-X group, where X is 

that hydrolyze compounds containing a -C(-O) g 

oxygen or nitrogen. 

of enzym-s belonging to the group of carbonyl 
Specific classes of enzym ^ ^ ?eptide 

5 hydrolases are such as hya 

hydrolases (proteases) . 

n . as en-ynes classified under the Enzyme 
Proteases are here mean, as^ e. -> ^ ^ accordance with the 

""""Ttxl (T^Tof "he International Union of Biochemistry 
o Recommendations U"- 1 ' 

and Molecular Biology ( IUBM3) . 

• c elected from those classified under 
Examoles include proteases select 
the Enzyme Classification (E.G.) manners: 

^c-^e" axinopeptidases) , including 3.4.11-5 
U - S - a " 3 4H9 (X-pro aminopeptidase) , 

(Prolyl aminopeptxdase) , • • ^ nopeptida8e) , 3.4.11.12 
3.4.11.10 (Bacterial * * 15 (Lysyl aminopeptidase), 

(Thermophilic aminopeptidase), • • (Methionyl 

v,r,i,i aminooeotidase) , j . * - j. •>■ • ■>■ 
30 3.4.11.17 (Tryptophanyl ammo?-. 

aminopeptidase) . 

npri serine er.dooeotidases) , including 3.4.21.1 
3.4.21 (i.e. so-callea serxn. e (CuC umisin) , 

(Chymotrypsxn) , 3-*; -- '* (Cer evisin) and 3.4.21.62 

35 3.4.21.32 (Brachyunn) , 3.-.^-- 

(Subtilisin) ; 
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3-4.22 (i.e. so-called cy Steine endopeptidases) lnrl „■ 
3-4.22.2 (Papain), 3 4 22 3 f P w, ' lnCludln S 

3-— (Asclepain)' 3 4 ^TV;-'^- 
5 (Caricain) and3. 4 , 2 . 31 i^^" 3.4. 22 . 30 

3.4.23 (i. e . so . called aspart . c end 

3-4.23.1 (Pepsin A) 3 4 23 18 fa • , , "eluding 

(Penicillopepsin; ^ 23 25 { ^ rgUlqi T ia " ' 

-*.-*j.25 (Saccharopepsin) ; and 



10 

3 .4 .24 



™» Prot e ase . proceina ln K , J;;:;™' /r lysin ' Bicillus 

^ -release TW7, and Protease TW3 . 

» SP-itic exa^pies of such readily available ^ 

include Esperase® ^ Clal Proteases 

Py-se @ Panc ^ @ ' NSUtraSe@ ' Dyr32 ^' S -inase @ , 

J: r o' ( 6 TrYP51n N0V ° (PTN) ' Bi °-— >«>• Clear- 

-ro (all enzymes available fro. Novo Nordisk A/s, . 

2S Examples of other co7 n -ri,l „ . 

Maxacal® m ^ " Proteases include Maxatase®. 

Maxacal®, Maxapem® marketed by Gist-Brocades N v Qpticl n 
marketed by Solvav et a* „ n °Pticlean© 

y oivay et Cie . and Purafect® marketed bv p»„ 
International y Gen encor 



30 It 



protease variants are i - interest. Examples of suc h 

Aancs ar e disclosed in Ed 130 ica m 
214-435 (Henkel), WO 87/0446, T ^nentech) , £P 

251.44, (cenencor EP 26 \" ^ ' , ^ 87/ ° 5 ° 5 ° (G — « 

3S Nature. 318 p 5 37s Th ' ^ ^ al " < 198 ^ 

' P - 375 -376, Thomas et al . , (i 987 ) T M . 

I", pp. 803-813, Russel et al ( 198 7, J ' ^ " ' 

al., (1987), Nature, 328, p. 496-500, 
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, ,« (Cenex) WO 88/08033 (Amgen) , WO 89/06279 (Novo 
WO 88/08028 (Genex) , WU Nnrdisk A / S ) , EP 525 610 

Hordis, A/S), WO 91/00345 (Novo Nord.sk 
(Solvay) and WO 94/02618 (Gist-Brocades N.V. ) . 

f nroteases can be determined as described in 
5 The activity of third edition , 1984 , Verlag 

-Methods of Enzymatic Analysis , 

Chemie, Weinheim, vol. 5. 

anf as enzymes classified under the Enzyme 
Lip ases are here mean, as ^ ^ Ester Hydrolase s) in 

10 "^""rrjte R c Lendations (1992) of the Xntemational 
accordance witn cne ^ c /tttomtu 
Union of Biochemistry and Molecular B.ology (I<WB) . 

. , „ selected from those classified under the 

Examples include lipases select 

15 Enzyme Classification (E.C.) numbers: 

311 (i . so-called CarboxyUc Ester Hydrolases,, including 
O.Li^'Triacylglycerol lipases, O . ! . 1 .4 . . Phospholipase 

, of lipases include lipases derived from the following 
" "a^ The indrcatec patent publications are »- 
corporated herein ^by ^ „ ^ _ 

Hunucola, e.g. n. 

thermoidea and H. insoler.s (US 4,810,414) 

„ 0 Ds f^agi, Ps. stutzeri, Ps. cepacia and Ps. 

?SeUdOT ° na£ ' 9 R ; /0£3 ; 1) 0 . ps. plantarii or Ps . gladioli (US 
fluoresces l«0 89/04361), o. . P alcaligenes and Ps . 

. mo a 950 417 (Solvay enzymes)) or Ps . aicaa y 
paten, no. ^ 50 < 4 mendocina (w0 88/ 09367; US 

pseudoalcaligenes (EP 218 2 

30 5,389,536) . 

06*) or F. solani pisi (WO 
Fusarium, e.g. F. oxysporum -,-°°-> 

90/09446) . 

35Mucor ' ul so called K^omucorKe.g.H.mienei ( EP ,38 0,3,. 



WO 97/25410 

PCT/DK97/00014 

18 

Chromobac Cerium (especially C. viscosum) 
Aspergillus (especially A. niger) . 

s Can d i da , e . g . c . cyiindracea (also caUed c ^ 

(WO 88/02775, or C. anta.ctica l ipase A or B (WQ 
94/01541 and WO 89/02916) . 

Geotricum, e.g. G . candidlm efc 
10 Biochem., 106, 383-388, ' " 

iT«.«T: e " p ' ~ rtii (Yama9uchi etai - — 

- Scopus, e.,. deiemar {Hass ec ^ (1991)< ^ io? _ 
113) or R. niveus (Kuginuya et aI . , (1992) fliosc ._ 
Sioche^ 56, 716-719, or R. oryzae. 

Bacillus, e.g. B. subtilis (Dartois et al (1993) py K . 

20 Biophysica acta 1131 253-?goi n ~ B 

, . ' 253 260) or 5 - stearothermophilus (j? 

64/7744992) or B . pu^ilus (WO 91/16422,. 

Specific examples of >-<=adi"iv aua n=Ku . , 

T . , ^ ava ilable commercial lipases includ- 

Lxpol..... Lipolase , Ble „, Lipozyrae@i p>UtMe9 _ ^ 

= s Lecitase® (all available fro™ Novo Mordisk A/S) . ' ' 

Espies of other l ipases are Luma£ast . ps . upas= 
fro™ Ceneucor Int . lnc . ; Lipomax ™ ps pseudoalcaligsnes I 
r» 01 « Broca des / G e n encor Int . lne ., Fusaritm 5o2imi 

30 <C„ tl „ ase) £ron UnUever; BaciUus ^ upase P - 



enzymes. Other Upases are available fro™ other co„pani 



es . 
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The activity of the lip.se can be determined as described in 
I thods of Enzymatic Third Edition. verlag 

c Lie, «einhein. vol. 4, or as described in AF S 5/5 GB (avail- 
able on request from Novo Nordisk A/S) . 

in this context the expression -carbohydrase- means .XI enzymes 
capab^ of breaking down carbohydrate chains (e.g. starches, of 
especially five and si, .ember ring structures (i.e. enzymes 
dassified under the Enzyme Classification number EC 3 
(glycosidasesl in accordance with the Recommendations (1952) 
h mternational Union of Biochemistry and molecular Biology 
I0 BHB„. Also included in the gr6up of carbohydrases according 
1 invention are enzymes capable of isomerizing carbohydrates 
e g. "six member ring structures, such as D-glucose to e.g. five 
member ring structures like D- fructose. 

Examples include carbohydrases selected from those classified 
under the Enzyme classification (E.c.) numbers: 

' (3 2 11) p-amylase (3.2.1.2), glucan 1,4-a- 

g lu cosid,s. .3. 2.1. J). cellulase .3.2.3.4), endo.l,3<4,- p- 
lucanase ,3.2.1.,,. endo-3 . 4 -Mylanase ,3.2.3.8), oext a e 

L . . / -j n - I/) Dolygalacturonase (3.2.l.ibj, 

(3 2 1 11), chitmase (3.2.i.i^>, ^ Ui ^ 

' /o o i nil a-qalactosidase 

o n 17 ) B-clucosiaase (3. 2.1. 21). a y d * dL 

* - ■ - <-> 9 i 93) ■ amylo-1, 6-glucosidase 

(3.2.1.22), p-galactosiaase (3.2.1.23), ay ' y 

(3 2 n 33), xylan 1 , 4 -xylosidase (3.2.1.37), gxucan enoo-1 3- P - 



D . gluC osidase 13.2.1.39). a-dextrin endo-l.e-gluco-xd... 

(3.2.1.41), sucrose a- g lucosidase (3.2.1.48). glucan endo- 3-«- 

■* « (3 2 159), glucan 1 , 4 -p-glucosidase (3.2.1.74), 
30 glucosidase (3 . / . J- ■ > y 

0 i (3 2 1 75), arabinan endo-l,5-a- 
alucan endo-1 , 6 -p-glucosidase (3.^.1. isi. 

9 . A ,,2 199) lactase (3.2.1.108), ch,tonanase 

arabinosidase (3.2.1.yyJ, 

(3.2.1.132) and xylose isomerase (5.3.1.5). 

1 ^ ra-bohvdrases include a-l,3-glucanases 
t; Examoles of relevan*. caraa.iw.a 

35 Examples harzia ^ ; a-1, 6-glucanases derived from 

derived from Trachoderma harzia..jm, u , a 
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arabinosidase (3.2.1.99), lactase (3.2.1.108), chitonanase 
(3.2.1.132) and xylose isomerase (5.3.1.5). 

Examples of relevant carbohydrases include o-i # 3-glucanases 
s derived from Trichoderma harzianum; a-l, 6-glucanases derived from 
a strain of Paecilomyces ; P-glucanases derived from Bacillus 
subtilis; p-glucanases derived from Humicola insolens; p. 
glucanases derived from Aspergillus niger; P-glucanases derived 
from a strain of Trichoderma; P-glucanases derived from a strain 
D of Oerskovia xanthineolytica; exo-l,4-a-D-gl U cosidases (gluco- 
amylases) derived from Aspergillus ' niger ; a-amylases derived from 
Bacillus subtilis; a-amylases derived from Bacillus 
amyloliquefaciens; a-amylases derived from Bacillus 
stearotherroophilus; a-amylases derived from Aspergillus oryzae; 
a-amylases derived from non -pathogenic microorganisms; a- 
galactosidases derived from Aspergillus niger; Pentosanases, 
xylanases, cellobiases, cellulases, hemi -cellulases derived from 
Humicola insolens; cellulases derived from Trichoderma reesei; 
cellulases derived from, non-pathogenic mold; pectinases' 
cellulases, arabinases, hemi -celluloses derived from Aspergillus 
niger; dextranases derived from Penicillium lilacinum; endo- 
glucanase derived from non-pathogenic mold; pullulanases derived 
from Bacillus acidopullyticus ; p-galactosidases derived from 
Kluyveromycea fragilis; xylanases derived from TrichodexTna 



reesei ; 



Specific examples of readily available commercial carbohydrases 
include Alpha-Gal™, Bio-Feed™ Aloha, Bio-Feed™ Beta, Bio-Feed™ 
Plus, Bio-Feed- Plus, NovozymeS 188, Carezyme®, Celluclast®, 
Cellusoft®, Ceremyl®, Citrozym™, Denimax™, Dezy^e™! 
Dextrozyme™, Finizym®, Funcamyl™, Gamanase™, Glucanex®, 
Lactozym®, Maltogenase™, Pentopan™, Pectinex™, Promozyme®, 
Pulpzyme™, Novamyl™, Termamyl®, AMG (Amyloglucosidase Novo), 
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Maltogenase®, Sweetzyme®, Aquazym® (all enzymes available from 
Novo Nordisk A/S) . Other carbohydrases are available from other 

companies. 

It is to be understood that also variants of such carbohydrases 
are contemplated as the polypeptide of interest. 

The activity of carbohydrases can be determined as described in 
-Methods of Enzymatic Analysis", third edition, 1984, Verlag 
Chemie, Weinheim, vol. 4. 

Oxidoreductases are here meant to be -enzymes classified under the 
Enzyme Classification number E.G. 1 (Oxidoreductases) in 
accordance with the Recommendations (1992) of the International 
Union of Biochemistry and Molecular Biology (IUBMB) . 

Examoles include oxidoreductases selected from those classified 
under the Enzyme Classification (E.C.) numbers: 

Glycerol-3-phosphate dehydrogenase (NAD*) (1.1.1.8) , Glycerol-3- 
.phosohate dehydrogenase NAD (P) ' (1.1.1.94). Glycerol -3 -phosphate 
1-dehydrogenase (NADP) (1.1.1. 94-). Glucose oxidase (1.1.3.4), 
Hexose oxidase (1.1.3.5), Catechol oxidase (1.1.3.14), Bilirubin 
oxidase (1.3.3.5), Alanine dehydrogenase (1.4.1.1). Glutamate 
d-hydroaenase (1.4.1.2). Glutamate dehydrogenase (NAD(P) 
5 (1.4.1.3), Glutamate dehydrogenase (NADP') (1.4.1.4). L-Amino 

m / i o serine dehydrogenase (1.4.1-7), 
acid dehydrogenase (1.4.1.5), b-nne j ? 

Valine dehydrogenase (NADP') (1.4.1.8). Leucine dehydrogenase 
(14 19), Glycine dehydrogenase (1.4.1.10), L-Amino-acid oxidase 
(lY 3 2). D-Amino-acid oxidase (1 . 4 . 3 . 3 ) , L-Glutamate oxidase 
(1 4 3.11), Protein-lysine 6-oxidase (1.4.3.13), L-lysine oxidase 

u ^ « v ;na=P fl 4 3.16), D-amino-acid 
(1 4 3.14), L-Aspartate oxidase {i.*.3.±o), 

" ' / -i a qq i ) p-otein disulfide reductase (1.6.4.4), 

dehydrogenase (l,4.yy.-U/ — Ul 

Thioredoxin reductase (1.6-4.5), Protein disulfide reductase 

(glutathione) (1.8.4.2), Laccase (1.10.3.2), Catalase (1.11.1.6). 

M in 17) L^oxvcenase (1.13.11.12), Superoxide 
35 Peroxidase (1.11.1./), L -i- J ''. v - 

dismutase (1.15.1.1) 



10 
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Said Glucose oxidases may be derived from A S per g il lus „ i?sr . 

Said peases „> y be deri ved fro™ Mypmu pinsltus 
Mycelaophtora t^ophila, co^nus cinereus, teM0Bia ' 
. K^octonia praticola. S<yttJJ<Hu . ^ ' 

vernicifera. 



Bilirubin oxidases may be derived from ^ro^Ceoi™ verrucaria. 

The Peroxidase may be derived fro™ Soy bean. Horseradish or 

Coprinus cinereus. 

The Protein Disulfide reductase .ay be any mentioned in any of 

the D.< patent applications no. 768/93 265/94 „„,, 

» A / S) , which are hereby inco ; 0 ; at 2 ; d 5 ^ r er r 

ding Protein Disulfide reductases of bovine oriain n 
Oi-ulfiC reductases derived fron ^ U ^ 

^Ilus n Iff er, and DsbA or Dsbc derived from Esckerictia coli. 



Specie examples of readi ly available commercial oxidoreductases 

nclude Cluzyme™ (enzyme available from Novo No rdi sk A/ 
However, other oxidoreductases are available from others. 

It is to be understood that also variants of oxidoreductases are 
25 contemplated as the polypeptide of interest. 

The activity of oxidoreductases can be determined as described • 
"Methods of Bnzymat.c Analysis.., third edition, ^ 
Chemie, Weinheim, vol. 3. Verlag 



30 



in th 1S context transferases are enzymes classified under the 
Enzyme Classification number E . c . 2 in accordance ^ * 
Recommendat.ons (1992) of ths : : ,,rnationai Union of Biochemis £ 
and Molecular Biology (IUBMB) . locnemistry 

35 
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The transferases may be any transferase in the subgroups of 
transferases: transferases transferring one-carbon groups EC. 
2 l) • transferases transferring aldehyde or resxdues (E.G. 2.2); 
acyltransferases (E.G. 2.3); glucosyltransf erases (E.G. 2.4); 
transferases transferring alkyl or aryl groups, other that methyl 
groups (E.G. 2.5); transferases transferring nitrogenous groups 

(2.6) . 

in a preferred embodiment the transferease is a transglutaminase 
E.C 2.3.2.13 (Protein-glutamine y-glutamyltransferase) . 

Transglutaminases are enzymes capable of catalysing an acyl 
transfer reaction in which a y- carboxyamide group of a peptide- 
bound glutamine residue is the acyl donor. Primary amino groups 
in a variety of compounds may function as acyl acceptors with the 
subseouent formation of mono-substituted y-amides of pept.de- 
bound" glutamic acid. When the epsilon-amino group, of a lysine 
residue in a peptide-chain serves as the acyl acceptor, the 
transferases form intramolecular or intermodular y-glutamyl-*- 
lysyl crosslinks. 

Examples of transglutaminases are described in the pending DK 
patent application no. 990/94 (Novo Nordisk A/S) . 

, . an < naS o nav th- of human, aminal (e.g. bovine) or 
5 The transglutaminase may 

mcrobially origin. 

Exa^ies °£ ■<"* transgiuca^inases are .ni«l derived 

transaminases, FXIILi -icrabi.1 transgluta^nases derived 

•> or ,ui,™ (Kle<« et al., Journal of Bactenol- 
o from Physarum polycephalim (Kle... 

ogy 174, p. 2599-2605); transglutaminases derived from Strep- 

• i„^nn qrreo^rvces lavendulae, Streptowyces 
tomyces sp., including S.rep^.-/^ 

lydicus (former Streptcyces l<-cz*i) »nd Streptoverticilliu™ sp., 
including streptoverticilliu. ro3 baraense, Streptoverticilliu. 
35 cin^oneun, and Streptovercic. JHo. griseooameu. (Motoki et 
al US 5.156,956,- Andou et al.. US 5,252,469.- Kaempfer et al., 
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Journal of General Microbiology, 137, 1831-1892; Ochi et a J 
International Journal of Sytematic Bacteriology, 44, 285-292- 
Andou et al . , US 5,252,469; Williams et al . , Journal 'of General 
Microbiology, 129, 1743-1813). 

It is to be understood that . also transferase variants are 
contemplated as the polypeptide of interest. 

The activity of transglutaminases can be determined as described 
io in "Methods of Enzymatic Analysis", third edition, 1984, Verlag 
Chemie, Weinheim, vol. 1-10. 

In this context phytases are enzymes classified under the Enzyme 
Classification number E.G. 3.1.3 (Phosphoric Monoester 
is Hydrolases) in accordance with the Recommendations (1992) of the 
International Union of Biochemistry and Molecular Biology 
(IUBMB) . • ay 

Phytases are enzymes produced by microorganisms which catalyse 
20 the conversion of phytate to inositol and inorganic phosphorus 

Phvtase producing microorganisms comprise bacteria such as 
Bacillus subtilis, Bacillus natto and Pseudomonas ; yeasts such as 
Saccharoses cerevisiae; a,d fungi such as Aspergillus niger 
25 Aspergillus ficuwu, Aspergillus awa^ori, " Aspergillus oryzae, 
Aspergillus terreus or Aspergillus nidulans, and various other 
Aspergillus species) . 

Examples of phytases include phytases selected from those 
30 classified under the Enzyme Classification (E.C.) numbers: 3- 
phytase (3.1.3.8) and 6-phytase (3.1.3.26). 

The activity of phytases car. be determined as described ir 
"Methods of Enzymatic Analysis", third edition, 1984, Verlag 
35 Chemie, Weinheim, vol. l-io, or may be measured according to the 
method described in EP-A1-0 420 358, Example 2 A. 
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I» the present context an anti-microbial polypeptides may be any 
polypeptide exhibiting anti-microbial activities, such as anti- 
fungal, anti-bacterial, and/or anti-inaecticidal activity. 

S Such polypeptides may also exhibit other activities such as 

enzymatic activity. 

Examples of anti-microbial polypeptides according to the 
iO invention include, fungicidal^ active polypeptides derived from 
the mold genus Curvularia described in WO 94/0*459 (Novo Hordis* 
A/ S) , anti-bacterial poiypepcides , described in E P 403 458 
.Kabigen AB) ; anti-microbial proteins isolated from the «irabilis 
seed, described in WO 92/15691 (Imperial Chem Ind. PLC); anti- 
« bacterial polypeptides isolated from an extract of pig small 
s"t in , des^rLd in WO 9 2 / 22 578 .Boman at al., ; polypeptide 
with yeast lethal action accumuiated by yeast of Hansenuia spp. 
as described in JP-60130599 ; Phytolacca insularis antiviral 
protein, which can be used as an anti-microbial described in US 
',» „„. 5.348,6*5 Win Ro LTD.).- bacteriolytic enzymes 
'preparations derived £rora Noc ardiopsis dassonviHei described in 
US patent no. 5.354,681 (Novo Industri A/S) . 

Exa.oles of other anti -microbial polypeptides are maganinin. 

S tr, defensin oseudomycin, mutanolysin and 

25 protegrin, aerensiu, 

acetylmuramidase . 

The oresent invention in a further aspect reXates to a method for 
generating a DNA sequence encoding a desired variant of 
30 polyoeotide of interest, wherein 

(i , " " a mutant library is produced by the above method 
( ii, said library is cultivated under conditions conducive fo 
the expression of said ,,,, of interest to produce variant 
- polypeptides, 
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(iii) said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

the DNA encoding said variants is isolated. 



(iv) 



(i) 
(ii) 



(iii) 



(iv) 



The present invention in a still further aspect relates to a 
method for the determination of a DNA sequence encoding a desired 
variant of a polypeptide of interest, wherein 

a mutant, library is produced as described above, 
said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 
polypeptides, 

said variant polypeptides are screened or selected for a 
desired property, and hosts producing desired variants 
identified and isolated, 

said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 

desired variant. 

This aspect of the invention can be performed by making dilutions 
of the library (e.g. in microtiter plates) and culturing these, 
whereby populations are made each originating from one member of 
the library, and the variant polypeptide produced from each of 
the populations screened for the desired orooerties 
Alternatively the library might be plated on aga'r-plates 
containing a desired growth medium that allows for the screening 
of or selection for desired properties of the variant 
polypeptide . 

If the phage display method is used, the screening or selection 
is performed directly with the phages. 

The criteria used for the selection will vary according to the 
end use of the polypeptide variant of interest, but properties 
typically being tested may include solubility and half-life in 
various media, antigenicity and allergenicity, thermal stability, 
oxidation stability, storage stability, substrate specificity and 
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affinity/ stability to non-aqueous solvents, pH profile, ionic 
strength dependence, catalytic efficiency, and compatibility with 
other components of envisaged end products wherein the 
polypeptide variant will form a part. 

For enzymes to be used in detergents further properties to be 
investigated are, wash performance and compatibility with various 
surfaces, especially fabrics. 



Numerous 



other criteria could be mentioned. 



Upon identification of populations that produce variant 
polypeptides fulfilling the criteria selected, the DNA encoding 
the polypeptide variant of interest is isolated and sequenced by 
use of methods well known in the art. 

The invention furthermore comprises a process for the production 
of a desired polypeptide variant, wherein 

(i) a DNA sequence determined as indicated above is introduced 
into a suitable host in a manner whereby it can be 
expressed in said host, 

(ii) said host is cultivated under conditions conducive to the 
expression of said DNA sequence, and 

(iii) said polypeptide variant is recovered. 

The present invention can be used with any cell, especially any 
microbial cell, but it is often suitable to use a prokaryote, 
especially a bacterium, preferably of the genus Bacillus, etc. 

d Among the Bacilli it is preferred to use a strain chosen from the 
group comprising B. lentus, B. lichenifomis , B . 
amyloliquefaciens, B. suhtilis , etc. 

For some uses it is preferable to use a microbial cell which is a 
5 fungus, especially a filamentous fungus, preferably of the genus 
Aspergillus, Tric-hoderma, etc. 
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Among the Aspergilli it is preferable to use a strain chosen from 
the group comprising A. oryzae, A. niger, A. awamori , etc. 

s Among the Trichoderwa it is preferable to use a strain chosen 
from the group comprising T. reseei, etc. 

In yet other situations it is more expedient to use a mammalian 
cell chosen from the group comprising BHK, etc. cells. 



The invention should not be construed to be limited to specific 
examples or embodiments mentioned In the specification above or 
the following examples. 



!S MATERIALS AND METHODS 



s 



EXAMPLES 

20 

EXAMPLE 1 

The system used is an Eschericia coli host cell, which i 
characterized by a number of chromosomal mutations: 
25 i} a Cs (thsrmosensitive) mutation in the pole gene (encoding 

DMA polymerase III, being the main replication polymerase) 
n) a mutation in the polA gene (encoding DNA polymerase I) 
causing an increased error rate by a reduction in the 3'- 
5' exonuclease activity. 
30 iii) repair deficiency by the mutL mutation. 

The target for the in vivo mutagenesis is plasmid pBR322 (colEl 
origin) having either (i) a frar-.e shift mutation, or (ii) a st0 p 
codon introduced into the tet gene, encoding a protein conferring 
35 resistance towards tetracyclin. 
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In each case the repair of the mutation leads to a dominant 
tetracycline resistant phenotype. 

PBR322 contains also the bla gene conferring resistance towards 
s ampicillin. The higher mutagenesis frequency at the target region 
is seen as a higher frequency of tetracycline resistant colonies 
after plating a culture exposed to "mutation-introduction" 
conditions. 

io An E. coli culture grown at 37°C to an optical density of 1 
measured at 600 nm is exposed to 2, 4 or 16 hours at restrictive 
temperature, e.g. 42°C. At these time points dilution series of 
the cultures are plated on LBagar supplementet with 
1) ampicillin (AmpR colonies) 

is 2) tetracycline and ampicillin . (AmpR and tetR colonies) 

The ratio of tetracycline resistant colonies to ampicillin 

resistant colonies indicate the number of cells in the culture 

that contains one copy of a repaired tet gene, indicated one 
20 specific mutagenesis event. 

This means, if a clone have become tetracycline resistant, at 
least one specific mutation has occurred to repair the originally 
introduced gene defect. 



25 
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PATENT CLAIMS 

1. A method for in vivo production of a library in cells 

comprising a multitude of mutated genetic elements, wherein an 
5 error-prone polymerase is used in each ancestral cell to 
replicate all or a part of a genetic element comprising 

i) an origin of replication from which replication is 
initiated, 

ii) optionally a genetic marker, e.g. a gene conferring 
10 resistance towards an antibiotic, 

iii) a gene encoding the polypeptide of interest, 
independently of the host chromosomal replication machinery. 

2. A method for in vivo production of a library in cells 

is comprising a multitude of mutated genetic elements comprising 
A) providing a microbial cell having 

i) an error-prone polymerase that independently of the 
chromosomal replication machinery of said cell will 
replicate all or a part of a genetic element 

20 comprising 

a) an origin of replication from which replication 
is initiated, 

b) optionally a genetic marker, e.g. a gene 
conferring resistance towards an antibiotic, 

O a gene encoding the polypeptide of interest, 
and 

ii) a chromosomal replication machinery that can be 
reversibly induced to be substantially non- 
functional, 

30 B) growing such a cell under conditions conducive to its 

replication to obtain a multitude of ancestral cells, 
C) reversibly inducing said chromosomal replication machinery 

in said ancestral cells t- be substantially non-functional 
- for a period of time sufficient to allow for the 
replication of said genetic element by said error-prone 
polymerase to generate mutations in said genetic element, 



25 
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D , reversibly inducing said chromosomal replication machinery 

in such mutated cells to be substantially functional, and 
E) growing such mutated cells under conditions conducive to 

their replication. 

3. The method of claim 1 or 2, wherein said cell further 

comprises a deficient repair system. 

4 The method of any of the claims 1 to 3, wherein said 

genetic element is a plasmid, a phagemid, a phage, a virus, a 
retrovirus or a retrotransposon. 

5 . The method of any of the claims 1 to 4, wherein said cells 

are microbial cells. 

6 The method of any of the claims 1 to 5, wherein said 
error-prone polymerase is selected from the group comprising DNA 
pel I, DNA pol II, reverse transcriptase and more specifically 
E coli DNA pol I. Bacillus skills DNA pol I, HIV reverse 

, transcriptase, T4 DNA polymerase, T7 DNA polymerase, Phi29 DNA 
polymerase . 

-v,„ ^i = -ir^ l to 6, wherein said 

7 The method of any or the claims l to t». 

chromosomal replication machinery that can be reversibly inducea 
5 to be substantially non- functional is a temperature sensitive E. 

coli DNA polymerase III 

8 A method for the determination of a DNA sequence encoding 
a desired variant of a polypeptide of interest, wherein 

J0 (i) a mutant library is produced by the method of any of the 
claims 1 to 7, wherein said genetic element comprises a 
gene encoding said polypeptide of interest, 
(ii, said library is cultivated under conditions conducive for 
. the expression of said gene of interest to produce variant 
35 polypeptides, 
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(iii) 



said variant polypeptides are screened or selected for a 
deseed property, and hosts producing desired variants 
identified and isolated, 
<iv> said genetic element in said hosts is sequenced to 
elucidate the DNA sequence of the mutant gene encoding a 
desired variant . 



9. A method for generating a DNA sequence encoding a desired 

variant of a polypeptide of interest, wherein 
o (i) a mutant library is produced by the method of any of the 
cla^s l to 7, wherein said genetic element comorises a 
gene encoding said polypeptide' of interest, 

(ii) said library is cultivated under conditions conducive for 
the expression of said gene of interest to produce variant 

> polypeptides, 

(iii) said variant polypeptides are screened or selected for a 
de S1 red property, and hosts producing desired variants 
identified and isolated, 

(iv) the DNA encoding said variants is isolated. 

10. A process for the production of a desired oolypeptide 
variant, wherein " 

(i) a DNA sequence obtained according to claim 9 or determined 
according to claim 8 is introduced into a suitable host in 
a manner whereby it car. be expressed in said host, 

(ID said host is cultivated under conditions conducive to the 
expression of said DNA sequence, and 

(iii) said polypeptide variant is recovered. 

H- A method of any of th» c^imq i m u 

y ui. i.u claims l to 10, wherein said 

polypeptide of interest is an enzyme. 

12. A method of claim li, wherein said enzyme is a carbonyl 
hyarolases , carbohydrases , oxidoreductases , transferases 
Phytases, ligases, lyases, and anti -microbial polypeptides 
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13. A method of claim 12, wherein said carbonyl hydrolase is a 
protease, or a lipase. 

14. A method of claim 12, wherein said carbohydrase is an 
5 amylase, glucosidase, cellulase, glucanase, xylanase, dextranase, 

chi tinase , polygalacturonase , lysozyme , glucosidase , 

galactosidase, xylosidase, arabinosidase, lactase, chitonanase, 
xylose isomerase, pectin esterase, rhamnogalacturonase, endo- 
glucanase . 

o 

15. A method of claim 12, wherein said oxidoreductase is a 
dehydrogenase, oxidase, reductase, Laccase, Catalase, Peroxidase, 
Lipoxygenase, Superoxide dismutase. 

5 16. A method of claim 12, wherein said transferase is a 
transferase transferring one-carbon groups, a transferase 
transferring aldehyde or residues , acyl transferase , 

glucosyltransferase, transferase transferring alkyl or aryl 
groups, other that methyl groups, transferase transferring 

o nitrogeneous groups. 

17. A method of any .of the claims 1 to 16, wherein said cell 
is a prokaryote, especially a bacterium, preferably of the genus 
Bacillus, Escherichia, Staphylococcus, and Streptococus . 

is 

18. A method of claim 17, wherein said Escherichia is a strain 
chosen from E. coli. 

19. A method of claim 17, wherein said Bacillus is a strain 
30 chosen from the group comprising B . lentus, B. licheniformis , B. 

amyloliquefaciens, B. subtilis. 

20. A method of any of the claims 1 to 16, wherein said cell 
is a 'fungus, especially a yeast or a filamentous fungus, 

35 preferably of the genus Aspergillus, Trichoderma. 
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21. ' A method of claim 20, wherein said Aspergillus is chosen 
from the group comprising A . oryzae, A. niger, A. awamori. 

22. A method of claim 20, wherein said Trichoderma is chosen 
from the group comprising T. reseei . 

23. A method of any of the claims 1 to 16, wherein said cell 
is a mammalian cell chosen from the group comprising BHK cells or 
an insect cell. 
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